PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa14g026700.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family GRAS
Protein Properties Length: 596aa    MW: 66865.5 Da    PI: 4.7638
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa14g026700.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS442.52.9e-1352265953374
            GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlt 97 
                     + L++cA+a+s+g+ e+a +++++l++++s +gdp qR+aay++e+Laar+a s++ +y+al+++e +   s+e+laa+++++ev+P++kf++l+
  Csa14g026700.1 226 QILISCARALSEGKSEEALSMVNELRQIVSIQGDPSQRIAAYMVEGLAARMAASGKFIYRALKCKEPP---SDERLAAMQVLFEVCPCFKFGFLA 317
                     78*****************************************************************9...9*********************** PP

            GRAS  98 aNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledle 190
                     aN aI ea++gee+vHiiDfdi+qG Q+++L++++a+ p++ p+lR+Tg+++pes+  s   l+ +g rL+++A+  gv+f+f++ v ++++ ++
  Csa14g026700.1 318 ANGAIIEAIKGEEAVHIIDFDINQGNQYMTLIRSIAELPGKRPRLRLTGIDDPESVqrSIGGLSIIGLRLEQLAKDHGVSFKFKA-VPSKTSIVS 411
                     ******************************************************9988899************************.7******** PP

            GRAS 191 leeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikv 285
                     +++L +kpgE+l+Vn+++qlh+++desv++ ++rde+L++vksl+Pk+v+vveq++++n+++F+ rf+ea eyysa+fdsl+++lpres+er++v
  Csa14g026700.1 412 PSTLGCKPGETLIVNFAFQLHHMPDESVTTVNQRDELLHMVKSLNPKLVTVVEQDVNTNTSPFFSRFIEAYEYYSAVFDSLDMTLPRESQERMNV 506
                     *********************************************************************************************** PP

            GRAS 286 ErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                     Er++l+r+ivn+vaceg+er+er e ++kWr+r+++aGF+p p+s++++++++ l+++ + + y+++ee g+l ++W++++L+++SaWr
  Csa14g026700.1 507 ERQCLARDIVNIVACEGEERIERYEAAGKWRARMMMAGFSPKPMSSRVTNNIQNLIKQQYCNNYMLKEEMGELHFCWEEKSLIVASAWR 595
                     ************************************************************888*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.945198575IPR005202Transcription factor GRAS
PfamPF035141.0E-132226595IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 596 aa     Download sequence    Send to blast
MVEQTVVREH IKARIMSLVR SAEPSSYRNP KLYSLNENVN NIGGVTSAQI FDQDRSKNPC  60
LTDDSYPSQS YEKYFLDSPT DEFVQQHPIG SGASVSSFGS LDSFPYQSRP VLGCSMEFQL  120
PFDSTSTSST RPLGDYQAVS YSPSMDVVEE FDDEQMRSKI QELERALLGD EDDKMVGIDN  180
LMEIDNEWSY QNESEQHQDS PKESSSADSN SHVSSKEVVS QTTPKQILIS CARALSEGKS  240
EEALSMVNEL RQIVSIQGDP SQRIAAYMVE GLAARMAASG KFIYRALKCK EPPSDERLAA  300
MQVLFEVCPC FKFGFLAANG AIIEAIKGEE AVHIIDFDIN QGNQYMTLIR SIAELPGKRP  360
RLRLTGIDDP ESVQRSIGGL SIIGLRLEQL AKDHGVSFKF KAVPSKTSIV SPSTLGCKPG  420
ETLIVNFAFQ LHHMPDESVT TVNQRDELLH MVKSLNPKLV TVVEQDVNTN TSPFFSRFIE  480
AYEYYSAVFD SLDMTLPRES QERMNVERQC LARDIVNIVA CEGEERIERY EAAGKWRARM  540
MMAGFSPKPM SSRVTNNIQN LIKQQYCNNY MLKEEMGELH FCWEEKSLIV ASAWR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A2e-562265956375GRAS family transcription factor containing p
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0154470.0AC015447.8 Arabidopsis thaliana chromosome I BAC F24J8 genomic sequence, complete sequence.
GenBankAY0458330.0AY045833.1 Arabidopsis thaliana putative scarecrow 1 protein (At1g21450) mRNA, complete cds.
GenBankAY0966260.0AY096626.1 Arabidopsis thaliana unknown protein (At1g21450) mRNA, complete cds.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010459856.10.0PREDICTED: scarecrow-like protein 1
SwissprotQ9SDQ30.0SCL1_ARATH; Scarecrow-like protein 1
TrEMBLD7KK120.0D7KK12_ARALL; Putative uncharacterized protein
STRINGfgenesh2_kg.1__2335__AT1G21450.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM69532744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21450.10.0SCARECROW-like 1